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A GENERAL IMPLICIT/INVERSE FUNCTION THEOREM 


BRUCE BLACKADAR 


Abstract. The Implicit and Inverse Function Theorems are special cases of 
a general Implicit/Inverse Function Theorem which can be easily derived from 
either theorem. The theorems can thus be easily deduced from each other via 
the generalized version. 


1. Introduction 

The Implicit and Inverse Function Theorems are the deepest results from dif¬ 
ferential calculus, and among the most important; they are the primary means of 
passing from the infinitesimal to the local in analysis and geometry. Although they 
are distinct results, each is a relatively easy corollary of the other, so one only needs 
to carry out the (rather difficult) proof of one of them to obtain both. They can 
both be regarded as aspects of the general problem of solving systems of functional 
equations; existence and uniqueness theorems for solutions to differential equations 
fall into the same class of results. 

These theorems have a long and complicated history, going back at least to 
Descartes (although for a long time it was not always appreciated that they had 
to be proved), and there are many versions. The Implicit Function Theorem 
(EU) was first proved by DiNi in the 1870’s, although Dini’s contributions were 
not properly recognized for a long time since he published them only in his lecture 
notes, which had a limited circulation. For continuous and higher-order smooth 
versions, see |Gou03) and |You09| . The book |KP02) contains the most thorough 
treatment available of the various forms of the Implicit and Inverse Function The¬ 
orems (although rather incredibly this book does not contain a complete correct 
proof of the basic Implicit Function Theorem as stated in 12.11 or in most Real 
Analysis texts). 

It is well known that the two theorems can be easily deduced from each other, 
and the arguments can be found in a number of Real Analysis texts. But it seems 
not to be well known that there is actually a general theorem of which both the 
Implicit and Inverse Function Theorems are special cases (a version of this general 
theorem appeared in |Nij74| ), and that the general theorem can be easily proved 
from either standard theorem essentially by the usual arguments for deriving the 
other theorem. Thus, once one of the theorems is proved, it is most natural to 
derive the other by stating and proving the general version. The aim of this article 
is to state the general version (Theorem I3.1|l and show how to obtain it from the 
usual ones. 
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2. Statements of the Theorems 

The idea of the theorems is that if one has n equations in m unknowns, then 
there is “usually” a unique solution if n = m, many solutions if n < m, and no 
solutions if n > m; if n < TO, it should be possible to solve for n of the variables as 
functions of the others (specifying values for to — n of the variables yields a system 
of n equations in the remaining n variables). By elementary Linear Algebra, these 
statements are literally true if the equations are linear unless there is degeneracy 
in the system. Since smooth functions are “approximately linear,” it might also be 
possible (and in fact is) to do the same locally in the smooth case. 

Vector notation is a convenient way to rephrase systems of equations. An equa¬ 
tion in unknowns xi,..., Xm can, by moving everything to one side of the equation, 
be written as F{xi,... ,Xm) = 0, where F : K"* —>■ R is a function. A set of n 
equations 

Fl (xi, . . . , Xm) — 0 

F 2 {xi, . . .,Xm) = 0 

Fn (xi, . . . , Xm) — 0 

can be written as F{x \,..., Xm) = 0, or F{x) = 0, where F is the function from R™ 
(or a subset) to R” with coordinate functions (Fi,..., F„), and x = {x\,. ■ ■ ^ Xm)- 

In the case to > n, it is convenient to change notation slightly by setting p = 
m — n. We would like to be able to solve the system F{x) — 0 for n of the variables 
in terms of the other p variables. (Here the relative sizes of n and p do not matter; 
we could have n < p, n = p, or n > p.) We can in general choose any n of the 
variables to solve for in terms of the other ones; we will notationally choose the last 
n and rename them j/i,..., ?/„, i.e. our equation to solve is of the form 

F{xi,. ..,Xp,yi,.--,yn)=0 

or F{x, y) = 0, where x = {xi,..., Xp) € W and y = {yi ,..., y„) S R". A solution 
on a subset W of R^ is a function f : W ^ R” such that the pair [x, y) for y = f{x) 
satisfies the equation for all x e W, i.e. F(x, f(x)) = 0 for all a: G IT. We say the 
function f is implicitly defined by the equation F{x,y) = 0. It is rarely possible 
to find an explicit formula for an implicitly defined function in the nonlinear case. 

Examples with n = p = 1 from elementary calculus show that one can only 
expect the equation F{x,y) = 0 to define y as a continuous function of x locally, 
i.e. in some neighborhood of a point a where (a, b) satisfies the equation, and 
then only around points where there is some nondegeneracy condition on F. If 
n = p = 1 and F is smooth, the graph of the equation F{x, y) = 0 is a curve in R^, 
and this curve is in general only locally the graph of a smooth function, and not 
even necessarily around every point of the curve; points where the curve crosses 
itself or has a vertical tangent (or has even more complicated local behavior) must 
be excluded. 

In the statements of the theorems, we will use the following notation. Let F be 
a function from an open set U in Rp+” to R". Write the coordinates of R^’+” as 
(xi,.. .,Xp,yi ,... ,y„), or {x,y) with x = (xi,... ,Xp) G R^ and y = (yi,... ,y„) G 
R” as above, and let Fi,..., F„ be the coordinate functions of F. So, in coordinates, 

F{x,y) = (Fi(x,y),... ,F„(x,y)) 

= (Fi(xi,...,Xp,yi,...,y„),...,F„(xi,...,Xp,yi,...,y„))) . 
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Let {x,y) G U, and suppose all partial derivatives of the form exist. 

Write 



DyF{x,y) = 



and, if all partial derivatives of the form ^^{x,y) exist, write 




DxF{x,y) = 




(note that DyF{x, y) is n x n and DxF{x, y) is n x p if they are defined). Then if 
F is differentiable at {x, y), we have that the n x (p + n) Jacobian matrix DF{x, y) 
partitions as 


DF{x,y)= D^F{x,y) \ DyF{x,y) 


Theorem 2.1. [Implicit Function Theorem] Let U be an open set in 
and (a, b) G U. Let F : U ^ K" with F{a, b) — 0. Suppose F is C on U for some 
T, 1 < r < oo. If DyF{a,b) is invertible, i.e. det{DyF{a,b)) 0, then there are: 

(i) An open set V in containing (a, b), with V , such that DyF{x, y) 

is invertible for all {x, y) G V; 

(ii) An open set W in containing a; 

(iii) A unique function f : W ^ R" such that f{a) = b and, for all x G W, 


{x, f{x)) G V and 

F{x,f{x)) = 0 . 

Additionally, f is on W, and for any x G W, we have 


Df(x) = -DyF{x, f{x)) ^D,,F{x, f{x)) 


(in particular, DyF{x, f{x)) is invertible for all x G W). 

Theorem 2.2. [Inverse Function Theorem] Let Lf be an open set in R", b G U, 
and f . U ^ R" a C function for some r, 1 < r < oo. If Df(b) is invertible, then 
there is a neighborhood V of b, V Q U, such that 

(i) / is one-to-one on V. 

(ii) f(y) is open in R". 

(iii) Df{x) is invertible for all x gV. 

(iv) f~^ is C'’ on f{V) and, for all y G fiV), 


Dif-^){y) = [Df{f-\y))]-^ . 


We will not discuss proofs of the individual theorems here (there are several 
standard ones). See [BE] (or many other Real Analysis texts) for a full discussion. 










4 


BRUCE BLACKADAR 


3. Twin Theorems and a Generalization 

The Inverse Function Theorem and Implicit Function Theorem are twin theorems 
in the sense that each is easily derivable from the other; the arguments are much 
easier than proving either theorem from scratch. In fact, either argument can be 
used to prove a common generalization: 


Theorem 3.1. [General Implicit/Inverse Function Theorem] Let n e N, 
p G N U {0}, a G b,c € R", U an open neighborhood of {a,b) in Rp+", 
F : U ^ R” a C’' function (I < r < oo) with F{a, b) = c and DyF{a, b) invertible. 
Then there are: 

(i) An open neighborhood V of {a,b) in RP’*'", contained in U, such that 
DyF{x,y) is invertible for all {x,y) G V; 

(ii) An open neighborhood W of (a, c) in R^’+"'; 

(iii) A unique function G : IF —>■ R” satisfying G{a, c) = b and, for all {x, z) G 
IF, (x, G(x, z)) G V and 

F{x,G{x,z)) = z . 

Additionally, G is C’’ on IF, and for all {x, z) G IF, 


DG{x, z) 


-DyF{x,G{x,z)) ^DxF(x,G{x,z)) 


DyF(x,G{x,z)) 1 


Note that the Inverse Function Theorem is the case p = 0 (so the a is absent). 
The Implicit Function Theorem is a corollary by taking c = 0 and setting f{x) = 
G{x,0). In fact, the Implicit Function Theorem says that for certain fixed 2 : near 
c, there is a function f implicitly defined near a by F(x, y) = z, which is C’' in x\ 
we have G{x, z) = f,^{x), and the additional content of the above theorem is that 
the are defined for all 2 ; near c and “vary smoothly” in z. Thus Theorem 13.II is 
a “parametrized version” of both the Implicit and Inverse Function Theorems. 

The situation is symmetric in F and G. G satisfies the hypotheses of the the¬ 
orem, so there is a C” function H defined near {a,b) for which H{a,b) = c and 
G{x,H{x,y)) — y for {x,y) near {a,b). By the formula, DH{x,y) = DF{x,y) 
for all {x, y) near (a, b), and since II(a, b) ~ F{a, b), we have H = F on a neigh¬ 
borhood of {a,b), i.e. H = F (perhaps restricted to a smaller neighborhood of 
(a, 6)). In particular, we have G{x, Fix, y)) = y for {x, y) near (a, b) (this can be 
verified directly). 

We will give simple proofs of 13.11 from both the Inverse Function Theorem and 
the Implicit Function Theorem. 

Proof Prom Inverse Function Theorem. Assume the Inverse Function Theo¬ 
rem [521 We will deduce [XT] 

Suppose U C RP+" is open, (a, b) G U, and F : G —>■ R” is a C” function with 
F{a,b) = c and DyF{a,b) invertible. Define 

F{x,y) = {x,F{x,y)) 
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for {x, y) G U. Then ^ is a C’' function from U to RP+”. We have 

Ip 0 

D^Flx,y) DyF{x,y) 

for {x,y) G U] thus DF{a,b) is invertible. So by the Inverse Function Theorem 
there is a neighborhood V of (a, b) on which F is one-to-one, W = F(V) is open, 
and the inverse function $ : bF —>■ is C. Since F{a,b) = (a,c), we have 
(a, c) G VF and $(a, c) = (a, b). 

Write 5>(a;,2) = {H{x, z),G{x, z)), where H : W ^ W and G : W ^ R’". 
Then H and G are C*", and G{a, c) = b. Since for (a:, 2 ;) G IF we have 

{x,z) = F{d>{x,z)) = F{H{x,z),G{x,z)) = {H{x, z), F{H{x, z),G{x, z))) 

we obtain that H{x, z) = x and 

F{H{x,z),G{x,z)) = F{x,G{x,z)) = z 

for {x, z) G IF. 

Uniqueness of G follows from the fact that F is one-to-one: if G is a function 
from IF to M" with {x,G{x, z)) G V and F{x,G{x, z)) = z for all {x,z) G IF, 
then for {x, 2 ;) G IF we have 

F{x, G{x, z)) = {x, z) = F{x, G{x, z)) 

so G{x, z) = G{x, z). 

Thus 13.il ls proved (the derivative formula is a straightforward exercise using the 
Chain Rule). 

Proof Fbom Implicit Function Theorem. Now assume the Implicit Function 
Theorem 12.II holds. We will deduce l3Tl 

Suppose U C IR.P+" is open, (a, b) G t/, and F : U ^ K" is as in the statement 
of EH Set c = F{a, b). Write Rp+^^ ^ Rp x K" x R”, and set 

U' = {{x,z,y)eW+^^ : ix,y)€U} . 

Then U' is open in Rp+^"’, and (a, c, b) £ U'. Define F : U' ^ R" by 

F{x,z,y) = F{x,y) - z . 

Then F is C’’ on [/', F{a, c, b) = 0, and 

DF{x,z,y) = [ D^f{x,y) DyF{x,y) ] 

for (a:, 2 ;, y) G U', and in particular DyF{a, c, b) = DyF{a, b) is invertible. 

We can thus solve the equation F{x,z,y) = 0 implicitly for y as a function of 
(a;, z), i.e. there are open neighborhoods IF of (a, c) and V of (a, b) and a unique 
function G : IF —>■ R" such that G(x,z) G V and F{x, z,G{x, z)) = 0 for all 
(a;, z) G IF, and G is C’’. But F{x, z, G{x, z)) = 0 is equivalent to F{x, G{x, z)) = 
z. 

Thus 13.11 is proved (the derivative formula is again a straightforward exercise). 


DF{x,y) = 
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4. Weakening the Differentiability 

Various versions of the Implicit and Inverse Function Theorems have been given 
over the years. The theorems hold verbatim with Euclidean spaces replaced by 
Banach spaces. For the Inverse Function Theorem, the continuous differentiability 
hypothesis on / can be relaxed to 

(i) strong differentiability at a with invertible derivative there ( |Lea6I) . [Kni88) l 

or 

(ii) differentiability in a neighborhood of a with i nvertibl e derivative every¬ 
where in the neighborhood ( |Ray02| , implicit in |Cer ii]; cf. [Tau] l. 

In each of these cases we get a corresponding Generalized Implicit/Inverse Func¬ 
tion Theorem by the same argument as in the proof of 13.11 and thus an Implicit 
Function Theorem: 


Theorem 4.1. (cf. |Nij74| j Let n S N, p G N U {0}, a G Rp, b,c G R", U an 
open neighborhood of {a,b) in RP’*'", F : U ^ R" a function which is strongly 
differentiable at {a,b), with F{a,b) = c and DyF{a,b) invertible. Then there are: 

(i) An open neighborhood V of {a,b) in Rp+”, contained in U; 

(ii) An open neighborhood W of (a, c) in RP+"'; 

(hi) A unique function G : IF — >■ R" satisfying G{a, c) = b and, for all {x, z) G 
IF, (x, G(x, z)) G V and 

F{x,G{x,z)) = z . 

Additionally, G is strongly differentiable at (a,c), and 


DG{a, c) 


-DyF{a,b)-^D,,Fia,b) DyF{a,b)-^ 


Theorem 4.2. Let n G N, p G NU{0}, a G Rp, b,c G R”, Lf an open neighborhood 
of {a, b) in Rp+", F : U ^ R" an everywhere differentiable function with F{a, b) = 
c and DyF{x,y) invertible for all {x,y) G Lf. Then there are: 

(i) An open neighborhood V of {a, b) in Rp+", contained in Lf; 

(ii) An open neighborhood W of (a,c) in Rp+"; 

(hi) A unique function G : IF —>■ R” satisfying G{a, c) = b and, for all {x, z) G 
W, (x, G(x, z)) G V and 

F{x, G{x, z)) = z . 

Additionally, G is everywhere differentiable on IF, and for all (x, z) G IF, 


DG{x, z) 


-DyF{x,G{x,z)) ^D,„F{x,G{x,z)) : DyF{x,G{x,z)) ^ 


Note that these theorems are technically not generalizations of 13.11 since both 
the hypotheses and conclusion are weaker. 
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5. Continuous and Mixed Versions 
There is a continuous version of the usual Implicit Function Theorem: 

Theorem 5.1. [Implicit Function Theorem, Continuous Version] Let U 
be an open set in K.P+"’ and {a,b) € U. Let F : U ^ K." with F{a^b) = 0. 
Suppose F is continuous on U and that DyF exists and is continuous on U, i.e. 
that exists and is continuous on U for 1 < j < n. If DyF [a,b) is invertible, 
i.e. det{DyF[a,b)) ^ 0, then there are: 

(i) An open set V in containing (a, b), with V Q U, such that DyF(x, y) 

is invertible for all {x, y) S V; 

(ii) An open set W in containing a; 

(hi) A unique function f : W ^ K" such that f{a) = b and, for all x € W, 
(x, f(x)) G V and 

F{x,f{x)) = 0 . 

Additionally, f is continuous on W. 

There is also a mixed version where F is differentiable with respect to some, 
but not necessarily all, of the ^-variables as well as all the y-variables. We will 
notationally divide the t- variables into q s-variables and m <-variables, where q + 
m = p, where the t-variables are the ones for which F is differentiable. We also 
say a function of y and possibly some additional variables is C’’ (r > 1) in y if all 
partial derivatives in the y-variables of order < r exist and are continuous. 

Theorem 5.2. [Implicit Function Theorem, Mixed Version] Let U be an 
open set in ^9+™+” and {a,d,b) € U. Let F ■. U ^ M" with F{a,d,b) = 0. 
Suppose F is continuous on U and on U in {t,y) for some r, 1 < r < oo. If 
DyF{a,d,b) is invertible, i.e. det{DyF{a, d,b)) ^0, then there are: 

(i) An open set V in ]R9+™+" containing {a,d,b), with V C U, such that 
DyF{s,t,y) is invertible for all {s,t,y) G V; 

(ii) An open set W in containing {a,d); 

(hi) A unique function f : W ^ M" such that f{a,d) = b and, for all {s,t) G 
W, {s,t, f{s,t)) G V and 

F{s,t,f{s,t)) = 0 . 

Additionally, f is in t on W, and for any {s,t) G W, we have 

Dtf{s,t) = -DyF{s,t, f{s,t))-^DtF{s,t, f{s,t)) 

(in particular, DyF{s,t, f{s,t)) is invertible for all {s,t) GW). 

The usual Implicit Function Theorem (I2.1|l is the case g = 0 and the continuous 
version (|5.1I) is the case m = 0. The continuous and mixed versions of the Implicit 
Function Theorem cannot be proved directly from the Inverse Function Theorem, 
and thus the mixed version is more general. Some of the usual proofs of the Implicit 
Function Theorem such as the proof by induction on n (which was historically the 
first proof) work essentially verbatim to give the mixed version (cf. |Bla) 1. We then 
get a general Mixed Implicit/Inverse Function Theorem by the same argument as 
before: 
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Theorem 5.3. [General Implicit/Inverse Function Theorem, Mixed Ver¬ 
sion] Let r G N, (j,m G NU {0}, a G d G M"*, b,c G M", U an open neigh¬ 
borhood of {a,d,b) in p - jj ^ ^ continuous function which is C’’ 

(1 < r < oo) in (t,y), with F(a,d,b) = c and DyF[a,d,b) invertible. Then there 
are: 

(i) An open neighborhood V of (a, d, b) in contained in U, such that 

DyF{s,t,y) is invertible for all {s,t,y) G V; 

(ii) An open neighborhood W of {a,d,c) m 

(iii) A unique function G : IF —> R” satisfying G{a, d,c) = b and, for all 
(s,t,z) G W, (s,t,G(s,t,z)) G V and 

F{s,t,G{s,t,z)) = z . 

Additionally, G is in {t,z) on W, and for all {s,t,z) € W, 


^{t,z)G(^s, t, z'j 


-DyF{s, t, G{s, t, z)) ^DtF{s, t, G{s, t, z)) 


DyF{s,t,G{s,t,z)) 1 


Theorem o is the case = 0, the Inverse Function Theorem is the case q = 
m = 0, and there is a continuous version with m = 0. 

It is unclear whether there are continuous or mixed versions of the theorems of 
Section 4. 
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